Avoiding the Curse of Dimensionality in Dynamic Stochastic Games∗
نویسندگان
چکیده
Discrete-time stochastic games with a finite number of states have been widely applied to study the strategic interactions among forward-looking players in dynamic environments. However, these games suffer from a “curse of dimensionality” since the cost of computing players’ expectations over all possible future states increases exponentially in the number of state variables. We explore the alternative of continuous-time stochastic games with a finite number of states and show that continuous time has substantial advantages. Most important, continuous time avoids the curse of dimensionality, thereby speeding up the computations by orders of magnitude in games with more than a few state variables. This much smaller computational burden greatly extends the range and richness of applications of stochastic games. ∗We thank Victor Aguirregabiria, Ken Arrow, Lanier Benkard, Michaela Draganska, Gautam Gowrisankaran, Rustam Ibragimov, Sarit Markovich, Ariel Pakes, Katja Seim, Gabriel Weintraub, and the participants of SITE 2004, the CRES IO Conference in 2005, and the Informs Annual Meeting in 2006 for their comments. Doraszelski gratefully acknowledges the hospitality of the Hoover Institution during the academic year 2006/07. An Online Appendix with supplemental material is available on the authors’ websites. †Cambridge, MA 02138, U.S.A., [email protected]. ‡Stanford, CA 94305-6010, U.S.A., [email protected].
منابع مشابه
Dynamic Stochastic Games with Sequential State-to-State Transitions
Discrete-time stochastic games with a finite number of states have been widely applied to study the strategic interactions among forward-looking players in dynamic environments. The model as written down by Ericson & Pakes (1995), Pakes & McGuire (1994, 2001) (hereafter, EP, PM1, and PM2), the subsequent literature (e.g., Gowrisankaran 1999, Fershtman & Pakes 2000, Benkard 2004), and in standar...
متن کاملThe Linear Programming Approach to Solving Large Scale Dynamic Stochastic Games∗
In this paper we introduce a new method to approximate Markov perfect equilibrium in large scale dynamic stochastic games that are not amenable to exact solution due to the curse of dimensionality. The method is based on an algorithm that iterates an approximate best response operator computed via the ‘approximate linear programming’ approach. We provide results that lend theoretical support to...
متن کاملReinforcement Learning in 2-players Games
The purpose of reinforcement learning system is to learn an optimal policy in general. However, in 2players games such as the othello game, it is important to acquire a penalty avoiding policy. In this paper, we are focused on formation of penalty avoiding policies based on the Penalty Avoiding Rational Policy Making algorithm [2]. In applying it to large-scale problems, we are confronted with ...
متن کاملLarge-scale games in large-scale systems
Many real-world problems modeled by stochastic games have huge state and/or action spaces, leading to the well-known curse of dimensionality. The complexity of the analysis of large-scale systems is dramatically reduced by exploiting mean field limit and dynamical system viewpoints. Under regularity assumptions and specific time-scaling techniques, the evolution of the mean field limit can be e...
متن کاملDesign and Experimentation of a Large Scale Distributed Stochastic Control Algorithm Applied to Energy Management Problems
The Stochastic Dynamic Programming method often used to solve some stochastic optimization problems is only usable in low dimension, being plagued by the curse of dimensionality. In this article, we explain how to postpone this limit by using High Performance Computing: parallel and distributed algorithms design, optimized implementations and usage of large scale distributed architectures (PC c...
متن کامل